Improving Low-Resource Speech Recognition Based on Improved NN-HMM Structures
نویسندگان
چکیده
منابع مشابه
Hybrid NN/HMM acoustic modeling techniques for distributed speech recognition
Distributed speech recognition (DSR) where the recognizer is split up into two parts and connected via a transmission channel offers new perspectives for improving the speech recognition performance in mobile environments. In this work, we present the integration of hybrid acoustic models using tied posteriors in a distributed environment. A comparison with standard Gaussian models is performed...
متن کاملImproved feature decorrelation for HMM-based speech recognition
In most HMM-based recognition systems, a mixture of diagonal covariance gaussians is used to model the observation density functions in the states. The use of diagonal covariance gaussians however assumes that the underlying data vectors have uncorrelated vector components: if each gaussian is replaced with its full covariant counterpart, the off-diagonal elements in the covariance matrices sho...
متن کاملAn improved training algorithm in HMM-based speech recognition
In HMM-based speech recognition, estimation of parameters of HMMs is viewed as counterpart of training or learning in traditional sequential pattern recognition since speech signal can be represented by a sequence of ndimension vectors after features are extracted from the speech signal. However, due to variation of duration of the phone with speakers and context and its randomness, speech samp...
متن کاملLanguage Adaptive DNNs for Improved Low Resource Speech Recognition
Deep Neural Network (DNN) acoustic models are commonly used in today’s state-of-the-art speech recognition systems. As neural networks are a data driven method, the amount of available training data directly impacts the performance. In the past, several studies have shown that multilingual training of DNNs leads to improvements, especially in resource constrained tasks in which only limited tra...
متن کاملLow memory acoustic models for HMM based speech recognition
In this paper, we propose a new approach to reduce the memory footprint of HMM based ASR systems. The proposed method involves three steps. Starting from the continuous density HMMs, mixture variances are tied using k-means based vector quantization. Next, the reestimation of the resulted models is performed with tied variances. Finally, scalar quantization is performed for the mean components ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
ژورنال
عنوان ژورنال: IEEE Access
سال: 2020
ISSN: 2169-3536
DOI: 10.1109/access.2020.2988365